A Syntax-Free Approach to Japanese Sentence Compression
نویسندگان
چکیده
Conventional sentence compression methods employ a syntactic parser to compress a sentence without changing its meaning. However, the reference compressions made by humans do not always retain the syntactic structures of the original sentences. Moreover, for the goal of ondemand sentence compression, the time spent in the parsing stage is not negligible. As an alternative to syntactic parsing, we propose a novel term weighting technique based on the positional information within the original sentence and a novel language model that combines statistics from the original sentence and a general corpus. Experiments that involve both human subjective evaluations and automatic evaluations show that our method outperforms Hori’s method, a state-of-theart conventional technique. Because our method does not use a syntactic parser, it is 4.3 times faster than Hori’s method.
منابع مشابه
Japanese Sentence Compression with a Large Training Dataset
In English, high-quality sentence compression models by deleting words have been trained on automatically created large training datasets. We work on Japanese sentence compression by a similar approach. To create a large Japanese training dataset, a method of creating English training dataset is modified based on the characteristics of the Japanese language. The created dataset is used to train...
متن کاملA Proper Treatmemt Of Syntax And Semantics In Machine Translation
A proper treatment of syntax and semantics in machine translation is introduced and discussed from the empirical viewpoint. For EnglishJapanese machine translation, the syntax directed approach is effective where the Heuristic Parsing Model (HPM) and the Syntactic Role System play important roles. For Japanese-English translation, the semantics directed approach is powerful where the Conceptual...
متن کاملA Generic Sentence Trimmer with CRFs
The paper presents a novel sentence trimmer in Japanese, which combines a non-statistical yet generic tree generation model and Conditional Random Fields (CRFs), to address improving the grammaticality of compression while retaining its relevance. Experiments found that the present approach outperforms in grammaticality and in relevance a dependency-centric approach (Oguro et al., 2000; Morooka...
متن کاملTransition and Parsing State and Incrementality in Dynamic Syntax
This paper presents an implementation of a gramar of Dynamic Syntax for Japanese. Dynamic Syntax is a grammar formalism which enables a parser to process a sentence in an incremental fashion, establishing the semantic representation. Currently the application of lexical rules and transition rules in Dynamic Syntax is carried out arbitrarily and this leads to inefficient parsing. This paper prov...
متن کاملAn Investigation of Exclamatives in English and Japanese: Syntax and Sentence Processing
Title of Dissertation / Thesis: AN INVESTIGATION OF EXCLAMATIVES IN ENGLISH AND JAPANESE: SYNTAX AND SENTENCE PROCESSING Hajime Ono, Ph.D, 2006 Dissertation / Thesis Directed By: Professor Howard Lasnik, Department of Linguistics This dissertation is a case study of the syntax of the left periphery, using exclamatives in English and Japanese. In the first part, I discuss exclamatives in Japanes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009